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1 <110> APPLICANT: Benner, Steven Albert 

3 <120> TITLE OF INVENTION: Evolution-Based Functional Genomics 
5 <130> FILE REFERENCE: file reference 10-765120 

7 <140> CURRENT APPLICATION NUMBER: 10/765, 120A 

8 <141> CURRENT FILING DATE: 2004-01-28 



10 


<160> NUMBER OF 


SEQ 


ID NOS: 38 
















12 


<170> SOFTWARE: 


Macintosh OS 10 


.3 Microsoft 


Word v. 


2003 


14 


<210> SEQ ID NO 


: 1 


















15 


<211> LENGTH: 486 


















16 


<212> TYPE: PRT 




















17 


<213> ORGANISM: 


Tilapia nilotica 














19 


<400> SEQUENCE: 


1 


















20 


Met Val Leu Glu 


Met 


Leu Asn Pro 


Met 


His 


Tyr Lys 


Val 


Thr 


Ser 


21 




5 






10 










15 


22 


Met Val Ser Glu 


Val 


Val Pro Phe 


Ala 


Ser 


He 


Ala 


Val 


Leu 


Leu 


23 




20 






25 










30 


24 


Leu Thr Gly Phe 


Leu 


Leu Leu Val 


Trp 


Asn 


Tyr 


Lys 


Asn 


Thr 


Ser 


25 




35 






40 










45 


26 


Ser lie Pro Gly Pro 


Gly Tyr Phe 


Leu 


Gly 


He 


Gly 


Pro 


Leu 


He 


27 




50 






55 










60 


28 


Ser Tyr Leu Arg 


Phe 


Leu Trp Met 


Gly 


He 


Gly Ser 


Ala 


Cys 


Asn 


29 




65 






70 










75 


30 


Tyr Tyr Asn Lys 


Thr 


Tyr Gly Glu 


Phe 


He 


Arg Val 


Trp 


He 


Gly 


31 




80 






85 










90 


32 


Gly Glu Glu Thr 


Leu 


He He Ser 


Lys 


Ser 


Ser 


Ser 


Val 


Phe 


His 


33 




95 






100 










105 


34 


Val Met Lys His 


Ser 


His Tyr Thr 


Ser 


Arg 


Phe 


Gly 


Ser 


Lys 


Pro 


35 




110 






115 










120 


36 


Gly Leu Gin Phe 


He 


Gly Met His 


Glu 


Lys 


Gly 


He 


He 


Phe 


Asn 


37 




125 






130 










135 


38 


Asn Asn Pro Val 


Leu 


Trp Lys Ala 


Val 


Arg 


Thr 


Tyr 


Phe 


Met 


Lys 


39 




140 






145 










150 


40 


Ala Leu Ser Gly Pro 


Gly Leu Val Arg 


Met 


Val 


Thr 


Val 


Cys 


Ala 


41 




155 






160 










165 


42 


Asp Ser lie Thr 


Lys 


His Leu Asp 


Lys 


Leu 


Glu 


Glu 


Val 


Arg 


Asn 


43 




170 






175 










180 


44 


Asp Leu Gly Tyr Val 


Asp Val Leu 


Thr 


Leu 


Met 


Arg 


Arg 


He 


Met 


45 




185 






190 










195 


46 


Leu Asp Thr Ser 


Asn 


Asn Leu Phe 


Leu 


Gly 


He 


Pro 


Leu Asp 


Glu 


47 




200 






205 










210 


48 


Lys Ala lie Val 


Cys 


Lys He Gin 


Gly 


Tyr 


Phe 


Asp 


Ala 


Trp 


Gin 


49 




215 






220 










225 


50 


Ala Leu Leu Leu 


Lys 


Pro Asp He 


Phe 


Phe 


Lys 


He 


Pro 


Trp 


Leu 
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51 



230 



235 



240 



52 


Tyr Arg 


Lys Tyr 


Glu 


Lys 


Ser 


Val 


Lys 


Asp 


Leu 


Lys Glu Asp Met 


53 








245 










250 




255 


54 


Glu 


He 


Leu He 


Glu 


Lys 


Lys 


Arg Arg Arg 


He 


Phe Thr Ala Glu 


55 








260 










265 




270 


56 


Lys 


Leu 


Glu Asp Cys Met Asp Phe Ala Thr Glu Leu He Leu Ala 


57 








275 










280 




285 


58 


Glu 


Lys 


Arg Gly Glu 


Leu 


Thr Lys Glu Asn Val Asn Gin Cys He 


59 








290 










295 




300 


60 


Leu 


Glu 


Met Leu 


lie 


Ala 


Ala 


Pro Asp 


Thr 


Met 


Ser Val Thr Val 


61 








305 










310 




315 


62 


Phe 


Phe 


Met Leu 


Phe 


Leu 


He 


Ala 


Lys 


His 


Pro 


Gin Val Glu Glu 


63 








320 










325 




330 


64 


Glu 


Leu 


Met Lys 


Glu 


He 


Gin 


Thr 


Val 


Val 


Gly Glu Arg Asp He 


65 








335 










340 




345 


66 


Arg Asn 


Asp Asp 


Met 


Gin 


Lys 


Leu 


Glu 


Val 


Val 


Glu Asn Phe He 


67 








350 










355 




360 


68 


Tyr 


Glu 


Ser Met 


Arg 


Tyr 


Gin 


Pro 


Val 


Val 


Asp 


Leu Val Met Arg 


69 








365 










370 




375 


70 


Lys 


Ala 


Leu Glu 


Asp Asp 


Val 


He 


Asp 


Gly 


Tyr 


Pro Val Lys Lys 


71 








380 










385 




390 


72 


Gly Thr 


Asn He 


He 


Leu 


Asn 


He 


Gly Arg Met 


His Arg Leu Glu 


73 








395 










400 




405 


74 


Phe 


Phe 


Pro Lys 


Pro 


Asn 


Glu 


Phe 


Thr 


Leu 


Glu 


Asn Phe Ala Lys 


75 








410 










415 




420 


76 


Asn 


Val 


Pro Tyr Arg Tyr 


Phe 


Gin 


Pro 


Phe 


Gly 


Phe Gly Pro Arg 


77 








425 










430 




435 


78 


Ala 


Cys 


Ala Gly Lys 


Tyr 


He 


Ala 


Met 


Val 


Met 


Met Lys Val Thr 


79 








440 










445 




450 


80 


Leu 


Val 


He Leu 


Leu Arg Arg 


Phe 


Gin 


Val 


Gin 


Thr Pro Gin Asp 


81 








455 










460 




465 


82 


Arg 


Cys 


Val Glu 


Lys 


Met 


Gin 


Lys 


Lys 


Asn Asp 


Leu Ser Leu His 


83 








470 










475 




480 


84 


Pro Asp 


Glu Thr 


Ser Gly 














85 








485 

















87 <210> SEQ ID NO: 2 

88 <211> LENGTH: 486 

89 <212> TYPE: PRT 

90 <213> ORGANISM: Oryzias latipes 

92 <400> SEQUENCE: 2 

93 Met Phe Leu Glu Met Leu Asn Pro Met Gin Tyr Asn Val Thr He 

94 5 10 15 

95 Met Val Pro Glu Thr Val Thr Val Ser Ala Met Pro Leu Leu Leu 

96 20 25 30 

97 He Met Gly Leu Leu Leu Leu He Trp Asn Cys Glu Ser Ser Ser 

98 35 40 45 

99 Ser He Pro Gly Pro Gly Tyr Cys Leu Gly He Gly Pro Leu He 

100 50 55 60 

101 Ser His Gly Arg Phe Leu Trp Met Gly He Gly Ser Ala Cys Asn 
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102 



65 



70 



75 



103 Tyr Tyr Asn Lys Met Tyr Gly Glu Phe Met Arg Val Trp He Ser 

104 80 85 90 

105 Gly Glu Glu Thr Leu He He Ser Lys Ser Ser Ser Met Phe His 

106 95 100 105 

107 Val Met Lys His Ser His Tyr He Ser Arg Phe Gly Ser Lys Arg 

108 110 115 ~ 120 

109 Gly Leu Gin Cys He Gly Met His Glu Asn Gly He He Phe Asn 
HO 125 130 135 

111 Asn Asn Pro Ser Leu Trp Arg Thr He Arg Pro Phe Phe Met Lys 

112 140 145 150 

113 Ala Leu Thr Gly Pro Gly Leu Val Arg Met Val Glu Val Cys Val 

114 155 160 165 

115 Glu Ser He Lys Gin His Leu Asp Arg Leu Gly Glu Val Thr Asp 

116 170 175 180 

117 Thr Ser Gly Tyr Val Asp Val Leu Thr Leu Met Arg His He Met 

118 185 190 195 

119 Leu Asp Thr Ser Asn Met Leu Phe Leu Gly He Pro Leu Asp Glu 

120 200 205 210 

121 Ser Ala He Val Lys Lys He Gin Gly Tyr Phe Asn Ala Trp Gin 

122 215 220 225 

123 Ala Leu Leu He Lys Pro Asn He Phe Phe Lys He Ser Trp Leu 

124 230 235 240 

125 Tyr Arg Lys Tyr Glu Arg Ser Val Lys Asp Leu Lys Asp Glu He 

126 245 250 " 255 

127 Ala Val Leu Val Glu Lys Lys Arg His Lys Val Ser Thr Ala Glu 

128 260 265 270 

129 Lys Leu Glu Asp Cys Met Asp Phe Ala Thr Asp Leu He Phe Ala 

130 275 280 285 

131 Glu Arg Arg Gly Asp Leu Thr Lys Glu Asn Val Asn Gin Cys He 

132 290 295 300 

133 Leu Glu Met Leu He Ala Ala Pro Asp Thr Met Ser Val Thr Leu 

134 305 310 315 

135 Tyr Phe Met Leu Leu Leu Val Ala Glu Tyr Pro Glu Val Glu Ala 

136 320 325 330 

137 Ala He Leu Lys Glu He His Thr Val Val Gly Asp Arg Asp He 

138 335 340 345 

139 Lys He Glu Asp He Gin Asn Leu Lys Val Val Glu Asn Phe He 

140 350 355 360 

141 Asn Glu Ser Met Arg Tyr Gin Pro Val Val Asp Leu Val Met Arg 

142 365 370 375 

143 Arg Ala Leu Glu Asp Asp Val He Asp Gly Tyr Pro Val Lys Lys 

144 380 385 390 

145 Gly Thr Asn He He Leu Asn He Gly Arg Met His Arg Leu Glu 

146 395 400 405 

147 Tyr Phe Pro Lys Pro Asn Glu Phe Thr Leu Glu Asn Phe Glu Lys 

148 410 415 420 

149 Asn Val Pro Tyr Arg Tyr Phe Gin Pro Phe Gly Phe Gly Pro Arg 

150 425 430 ~ *" 435 
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151 Gly Cys Ala Gly Lys Tyr He Ala Met Val Met Met Lys Val Val 

152 440 445 450 

153 Leu Val Thr Leu Leu Arg Arg Phe Gin Val Lys Thr Leu Gin Lys 

154 455 460 465 

155 Arg Cys He Glu Asn He Pro Lys Lys Asn Asp Leu Ser Leu His 

156 470 475 480 

157 Pro Asn Glu Asp Arg His 



160 <210> SEQ ID NO: 3 

161 <211> LENGTH: 486 

162 <212> TYPE: PRT 

163 <213> ORGANISM: Danio rerio 

165 <400> SEQUENCE: 3 

166 Met He Leu Glu Met Leu Asn Pro Met His Tyr Asn Leu Thr Ser 

167 5 10 ~ 15 

168 Met Val Pro Glu Val Met Pro Val Ala Thr Leu Pro He Leu Leu 

169 20 25 30 

170 Leu Thr Gly Phe Leu Phe Phe Val Trp Asn His Glu Glu Thr Ser 

171 35 40 45 

172 Ser He Pro Gly Pro Gly Tyr Cys Met Gly He Gly Pro Leu He 

173 50 55 60 

174 Ser His Leu Arg Phe Leu Trp Met Gly Leu Gly Ser Ala Cys Asn 

175 65 70 " 75 

176 Tyr Tyr Asn Lys Met Tyr Gly Glu Phe Val Arg Val Trp He Ser 

177 80 85 90 

178 Gly Glu Glu Thr Leu Val He Ser Lys Ser Ser Ser Thr Phe His 

179 95 100 105 

180 He Met Lys His Asp His Tyr Ser Ser Arg Phe Gly Ser Thr Phe 

181 110 115 A 120 

182 Gly Leu Gin Tyr Met Gly Met His Glu Asn Gly Val He Phe Asn 

183 125 130 135 

184 Asn Asn Pro Ala Val Trp Lys Ala Leu Arg Pro Phe Phe Val Lys 

185 140 145 150 

186 Ala Leu Ser Gly Pro Ser Leu Ala Arg Met Val Thr Val Cys Val 

187 155 160 " 165 

188 Glu Ser Val Asn Asn His Leu Asp Arg Leu Asp Glu Val Thr Asn 

189 170 175 180 

190 Ala Leu Gly His Val Asn Val Leu Thr Leu Met Arg Arg Thr Met 

191 185 190 195 

192 Leu Asp Ala Ser Asn Thr Leu Phe Leu Arg He Pro Leu Asp Glu 

193 200 205 210 

194 Lys Asn He Val Leu Lys He Gin Gly Tyr Phe Asp Ala Trp Gin 

195 215 220 225 

196 Ala Leu Leu He Lys Pro Asn He Phe Phe Lys lie Ser Trp Leu 

197 230 235 " ^ 240 

198 Ser Arg Lys His Gin Lys Ser He Lys Glu Leu Arg Asp Ala Val 

199 245 250 255 

200 Gly He Leu Ala Glu Glu Lys Arg His Arg He Phe Thr Ala Glu 

201 260 265 270 



158 



485 
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202 


Lys 


Leu 


Glu Asp 


His 


Val 


Asp 


Phe 


Ala 


Thr Asp Leu 


He Leu 


Ala 


203 








275 










280 




285 


204 


Glu 


Lys 


Arg Gly Glu Leu 


Thr 


Lys 


Glu 


Asn Val Asn 


Gin Cys 


He 


205 








290 










295 


300 


206 


Leu 


Glu 


Met Met 


He 


Ala 


Ala 


Pro 


Asp Thr Leu Ser Val Thr 


Val 


207 








305 










310 




315 


208 


Phe 


Phe 


Met Leu 


Cys 


Leu 


He 


Ala 


Gin 


His Pro Lys 


Val Glu 


Glu 


209 








320 










325 




330 


210 


Ala 


Leu 


Met Lys 


Glu 


He 


Gin 


Thr 


Val 


Leu Gly Glu Arg Asp Leu 


211 








335 










340 




345 


212 


Lys 


Asn 


Asp Asp 


Met 


Gin 


Lys 


Leu 


Lys 


Val Met Glu 


Asn Phe 


He 


213 








350 










355 




360 


214 


Asn 


Glu 


Ser Met 


Arg 


Tyr 


Gin 


Pro 


Val 


Val Asp He 


Val Met 


Arg 


215 








365 










370 




375 


216 


Lys 


Ala 


Leu Glu 


Asp 


Asp 


Val 


He 


Asp Gly Tyr Pro 


Val Lys 


Lys 


217 








380 










385 




390 


218 


Gly Thr 


Asn lie 


He 


Leu 


Asn 


He 


Gly Arg Met His 


Lys Leu 


Glu 


219 








395 










400 




405 


220 


Phe 


Phe 


Pro Lys 


Pro 


Asn 


Glu 


Phe 


Thr 


Leu Glu Asn 


Phe Glu 


Lys 


221 








410 










415 




420 


222 


Asn 


Val 


Pro Tyr Arg Tyr 


Phe 


Gin 


Pro 


Phe Gly Phe Gly Pro Arg 


223 








425 










430 




435 


224 


Ser 


Cys 


Ala Gly Lys 


Phe 


He 


Ala 


Met 


Val Met Met 


Lys Val 


Met 


225 








440 










445 




450 


226 


Leu 


Val 


Ser Leu 


Leu 


Arg 


Arg 


Phe 


His 


Val Lys Thr 


Leu Gin Gly 


227 








455 










460 




465 


228 


Asn 


Cys 


Leu Glu 


Asn 


Met 


Gin 


Lys 


Thr 


Asn Asp Leu 


Ala Leu 


His 


229 








470 










475 




480 


230 


Pro 


Asp 


Glu Ser Arg 


Ser 















231 485 

233 <210> SEQ ID NO: 4 

234 <211> LENGTH: 487 

235 <212> TYPE: PRT 

236 <213> ORGANISM: Carassius auratus 

238 <400> SEQUENCE: 4 

239 Val Leu Glu Leu Leu Met Gin Gly Ala His Asn Ser Ser Tyr Gly 

240 5 10 15 

241 Ala Gin Asp Asn Val Cys Gly Ala Met Ala Thr Leu Leu Leu Leu 

242 20 25 30 

243 Leu Leu Cys Leu Leu Leu Ala He Arg His His Trp Thr Glu Lys 

244 35 40 45 

245 Asp His Val Pro Gly Pro Cys Phe Leu Leu Gly Leu Gly Pro Leu 

246 50 55 ~ 60 

247 Leu Ser Tyr Cys Arg Leu He Trp Ser Gly He Gly Thr Ala Ser 

248 65 70 75 

249 Asn Tyr Tyr Asn Ser Lys Tyr Gly Asp He Val Arg Val Trp He 

250 80 85 90 

251 Asn Gly Glu Glu Thr Leu He Leu Ser Arg Ser Ser Ala Val Tyr 

252 95 100 105 
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RAW SEQUENCE LISTING DATE: 10/27/2004 

PATENT APPLICATION: US/10/7 65 , 12 OA TIME: 10:19:47 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\10272004\J765120A.raw 

1 <110> APPLICANT: Benner, Steven Albert 

3 <120> TITLE OF INVENTION: Evolut ion-Based Functional Genomics 
5 <130> FILE REFERENCE: file reference 10-765120 

7 <140> CURRENT APPLICATION NUMBER: 10/765, 120A 

8 <141> CURRENT FILING DATE: 2004-01-28 
10 <160> NUMBER OF SEQ ID NOS : 38 

12 <170> SOFTWARE: Macintosh OS 10.3 Microsoft Word v. 2003 



ERRORED SEQUENCES 0 >3S Mot C'OfHply 

1488 <2io> seq id no: 35 ^orwcmd Diskette Needec 

1489 <211> LENGTH: 84 / Z^)^ I J 

1490 <212> TYPE: DNA ^-(l <^J * ' ^ 

1491 <213> ORGANISM: Sus scrofa " 
1493 <400> SEQUENCE: 35 

W--> 1494 caatcattac acgtgccgat ttggcagcaa acttgggttg gaatgcattg gcatgcatga 60( 
E--> 1495 atgtttaaca ataa 84- q 

E--> 1515 <210> SEQ ID NO: ^7/^3^ — ' 3 J-t^ 

1516 <211> LENGTH: 84 ^ AlOV^dK iD 

1517 <212> TYPE: DNA Si P 

1518 <213> ORGANISM: White lipped peccary ' ^ , ( 
E--> 1520 <400> SEQUENCE: 38 /J ^ ■ 

1521 cagtcactac acatcccgat tcggcagcaa acctgggttg cagttcattg gaatgcatga 60 — -^"T 

1522 gaaaggcatc atatttaaca acaa 84 
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Inva lid Line Len gth: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:35; Line(s) 1494 
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VERIFICATION SUMMARY DATE: 10/27/2004 

PATENT APPLICATION: US/10/7 65 , 120A TIME: 10:19:48 

Input Set : A:\pto.lm.txt 
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L:1494 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALip- KEYS : 8 

L:1495 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID:35 

L:14 95 M:254 No . of Bases conflict, LENGTH : Input : 84 Counted: 14 SEQ : 3 5 C*^"^ ^- 

L:1495 M:252 No. of Seq. differs, <211> LENGTH : Input : 84 Found:14 SEQ:35 < — ^ 

L:1515 M:212 (34) Invalid or duplicate Sequence ID Number, SEQ ID NO: 37 

L:1520 M:212 E^ (34) Invalid or duplicate Sequence ID Number, SEQUENCE ID NOS:37 differs:38 ^ 
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